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Abstract 

We develop tools for analyzing focused stochastic local search algorithms. These are algorithms 
which search a state space probabilistically by repeatedly selecting a constraint that is violated in the 
current state and moving to a random nearby state which, hopefully, addresses the violation without 
introducing many new ones. A large class of such algorithms arise from the algorithmization of the 
Lovasz Local Lemma, a non-constructive tool for proving the existence of satisfying states. Here we 
give tools that provide a unified analysis of such algorithms and of many more, expressing them as 
instances of a general framework. 
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1 Introduction 


Let rj be a large finite set of objeets and let F = {/i, / 2 ,..., /m} be a eolleetion of subsets of Q. We will 
refer to eaeh fi £ F as a flaw to express that its elements have negative feature i G [m]. For example, given a 
CNF formula on n variables with elauses ci, C 2 ,..., Cm, we ean define for eaeh elause q the flaw (subeube) 
fi C {0,1}” whose elements violate Cj. Following linguistie rather than mathematieal eonvention we say 
that / is present in a if f 3 a and that a £ Q is flawless (perfeet) if no flaw is present in a. 

Our goal is to develop tools for analyzing stoehastie loeal seareh algorithms for finding perfeet objeets. 
Importantly, our analysis will not assume that 0 eontains perfeet objeets, but rather will establish their 
existenee by proving that an algorithm eonverges (quiekly) to one. The general idea in stoehastie loeal 
seareh is that O is equipped with a neighborhood strueture and that the seareh starts at some element (state) 
of O and moves from state to state along the neighborhood strueture. Focused loeal seareh eorresponds to 
the ease where eaeh state ehange ean be attributed to an effort to rid the state of some speeifie present flaw. 

Coneretely, for eaeh a £ Q., let U{a) = {f £ F : a £ /}, i.e., U{a) is the set of flaws present in a. 
For every flaw fi £ U (a), let A{i, a) / {a} be a non-empty subset of Q. We eall the elements of A{i, a) 
actions and we eonsider the multi-digraph Donfl whieh has an are cr r for every r £ A{i, a). We will 
eonsider walks on D whieh start at a state cji, seleeted aeeording to some probability distribution 9, and 
whieh at eaeh non-sink vertex a first seleet a flaw fi 9 a, as a funetion of the trajeetory so far, and then 
seleet a next state r G A{i, a) with probability pi(a, r). Whenever a flaw 9 ais seleeted we will say flaw 
fi was addressed (whieh will not neeessarily mean that /* was eliminated, i.e., potentially A{i, cr) n /* / 0). 

A large elass of algorithms for dealing with the setting at this level of generality arise by algorithmiza¬ 
tions of the Lovasz Loeal Lemma (LLL). This is a non-eonstruetive tool for proving the existence of flawless 
objeets by introdueing a probability measure p on 17, along the lines of the Probabilistie Method (throughout 
we assume that produets devoid of faetors evaluate to 1, i.e., n x&id9{x) = 1 for any 5 ). 

General LLL. Let A = {Ai,A 2 ,..., Am} be a set ofm events. For each i £ [m], let D{i) C [m] \ {i} be 
such that ii{Ai \ for every S C [m] \ {D{i) U {i}). If there exist positive real numbers 

{'fi}fLi such that for all i £ [m\, 


tpi 


SC{i}UD(i)iGS 


( 1 ) 


then the probability that none of the events in A occurs is at least 02=11/(1 + V’i) > 0- 

Remark 1, Condition (1) above is equivalent to the more well-known form p{Ai) < Xi njG£)(i)(l “ Xj), 
where Xi = fii/if F 'fi). As we will see, formulation facilitates refinements. 

Erdos and Speneer [6] noted that independenee in the LLL ean be replaeed by negative correlation, 
yielding the stronger Lopsided LLL. The differenee is that eaeh set D{i) is replaeed by a set L{i) C [m] \{f} 
sueh that p{Ai \ Cij^s^j) < for every 5 C [m] \ {L{i) U {f}), i.e., “=” is replaeed by “<”. 

In a landmark work [17], Moser and Tardos made the general LLL eonstruetive for product measures 
over explieitly presented variables. Speeifieally, in the variable setting of [17], eaeh event Ai is determined 
by a set of variables vbl(ylj) so that j £ D{i) iff vbl(yli) n vbl(Aj) / 0. Moser and Tardos proved 
that if (1) holds, then repeatedly seleeting any oeeurring event Ai (flaw present) and resampling every 
variable in vbl(Aj) independently of all others, leads to a flawless objeet after a linear expeeted number of 
resamplings. Pegden [20] extended the result of [17] to the eluster expansion eriterion of Bissaeott et al. [3], 
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and Kolipaka and Szegedy [13] extended it to Shearer’s eriterion (the most general LLL eriterion). Beyond 
the variable setting, Harris and Srinivasan in [11] algorithmized the general LLL for the uniform measure on 
permutations while, very reeently, Harvey and Vondrak [12] algorithmized the Lopsided LLL up to Shearer’s 
eriterion assuming effieiently implementable resampling oracles. Resampling oraeles, introdueed in [12], 
elegantly eapture a common constraint in all prior algorithmizations of LLL, namely that state transitions 
must be “compatible” with the measure p. Below we give the part of the definition of resampling oracles 
that exactly expresses this notion of compatibility, which we dub (measure) regeneration. 


Regeneration. Resampling Oracles] Say that {D, p) regenerate p at flaw /j if for every r G fl, 


1 

hifi) 


^ p{a)pi{a,T) = p{t) . 


( 2 ) 


Observe that the l.h.s. of (2) is the probability of reaching r after first sampling a state a £ fi according 
to p and then addressing f at a according to {D, p). The requirement that this probability equals p{t) for 
every r G 0 means that in every state a £ fi the distribution of actions for addressing f must be such that it 
removes the conditional f 3 a. A trivial way to satisfy this (very stringent) requirement is to sample a new 
state a' according to p in each step (assuming p is efficiently sampleable). Doing this, though, removes any 
sense of progress, as the set of flaws present in a' are completely unrelated to those in a. Instead, we would 
like to achieve (2) while limiting the set of flaws that may be present in a' that were not present in a. For 
example, note that in the variable setting resampling every variable in vbl(/i) independently satisfies (2), 
while only having the potential to introduce flaws that share at least one variable with /j. It is, thus, natural 
to consider the following “projection” of the action digraph introduced in the flaws/actions framework of [1]. 

Potential Causality. For an arc a ^ t in D and a flaw fj present in r we say that fi causes fj if f = fj 
or fj ^ a. If D contains any arc in which fi causes fj we say that fi potentially causes fj. 

Potential Causality Digraph. The digraph C = C{Q,, F, D) on [m\ where i ^ j iff fi potentially causes 
fj is called the potential causality digraph. The neighborhood of a flaw fi w r(z) = {j : i —)• J exists in C}. 

In the interest of brevity we will call C the causality digraph, instead of the potential causality digraph. It 
is important to note that C contains an arc f —)■ j if there exists even one state transition aimed at addressing 
fi that causes fj to appear in the new state. 

As mentioned, very recently, Harvey and Vondrak [12] made the Lopsided LLL algorithmic, given 
resampling oracles for p. Their result actually makes no reference to the lopsidependency condition, which 
they prove is implied by the existence of resampling oracles, and can be stated as follows. 

Theorem 1 (Harvey-Vondrak [12]), Let D, F, p, D, p be such that {D, p) regenerate p at flaw fifor every 
i £ [m\. If 9 = p and there exist positive real numbers such that for every i £ [m\, 

SQT{i)j&S 


then a perfect object can be found after polynomially many steps on D. 


In fact, in [12] it was shown that the conclusion of Theorem 1 holds also if (3) is replaced by Shearer’s 
condition [21]. Thus, the work of Harvey and Vondrak [12] marks the end of the road for the derivation and 
analysis of focused stochastic local search algorithms by algorithmizations of the Lovasz Local Lemma. 
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In this work we extend the flaws/aetions algorithmie framework of [1] to arbitrary measures and aetion 
graphs and eonneet it to the Lovasz Loeal Lemma. The result is a theorem that subsumes both Theorem 1 
and all results of [1], establishing a method for designing and analyzing foeused stoehastie loeal seareh 
algorithms that goes far beyond algorithmizing the LLL. 

Coneretely, in [1] we introdueed and analyzed foeused loeal seareh algorithms where: 

• D is atomic, i.e., for every r G and every i G [m] there exists at most one are ct -G- r. 

• jj, is the uniform measure on Q, i.e., ^(•) = 

• p assigns equal probability to every aetion in A{i, a), for every /j G U{a), at every flawed ct G fl. 


Here we generalize to arbitrary D, p,p,, allowing one to trade the sophistieation of the measure p against the 
sparsity of the eausality graph (while removing the need to sample from p, or to regenerate p). Moreover, 
for the speeial ease of uniform p, we improve the eondition of [1] for eonvergenee. We state our results 
formally in the next seetion. 

We also make a eoneeptual eontribution by identifying for eaeh measure p eertain pairs [D, p) as speeial. 
Harmonic Walks. {D, p, p) are harmonie if for every i G [m] and every transition (a, t) G fi x A{i, a), 


Pi{cr,r) 


K't) 

Ea'eA{i,a) 


oc p{t) 


(4) 


In words, when {D,p,p) are harmonie pi assigns to eaeh state in A{i,a) probability proportional to 
its probability under p. It is easy to see that {D,p,p) are harmonie both in the algorithm of Moser and 
Tardos [17] for the variable setting and in the algorithm of Harris and Srinivasan [11] for the uniform 
measure on permutations. There are two reasons why harmonie {D, p, p) eombinations are interesting. 


1.1 Resampling Oracles via Atomic Actions 

If we start with the Probabilistie Method setup, i.e., 11, F, and p, then to get a eonstruetive result by LLL 
algorithmization we must design {D, p) that regenerate p at every flaw fi G F (note that Theorem 1 assumes 
sueh {D, p) as input). While, in general, this ean be a daunting task, we show that if we restriet our attention 
to D that are atomie matters are dramatieally simplified: 

• {D,p, p) must be harmonie, yielding a local eharaeterization of {D, p) at every state. (Theorem 6) 

• The probability of every sequenee of states is characterized by the flaws addressed. (Theorem 5) 

• The initial state ean be arbitrary, i.e., we ean have 6 p. (Theorems 2-4) 

We note that all previous LLL algorithmizations, ineluding [12], require 9 = p. We remove this at a 
mere eost of adding 0(log |11|) to the running time. This is benefieial in settings where sampling from p (a 
global property) is hard, but nonetheless we ean regenerate p at every flaw (a loeal property). 

Atomieity may initially seem artifieial and/or restrietive. In reality, it is a very natural way to promote 
seareh spaee exploration, as it is equivalent to the following: A{i, a) n A{i, a') = 0 for every a a' G fi. 
Moreover, in most settings atomieity ean be aehieved in a straightforward manner. For example, in the 
variable setting the following two eonditions eombined imply atomieity: 
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1. Each flaw forbids exactly owe joint value assignment to its underlying variables, i.e., is a subcube. 

2. Each state transition modifies only the variables of the flaw it addresses. 

Condition 1 expresses a primarily syntactic requirement: compound constraints must be broken down 
to constituent parts akin to satisfiability constraints. In most settings, not only is such a breakdown straight¬ 
forward, but is also advantageous, as it affords a more refined accounting of conflict between constraints. 

Condition 2 on the other hand reflects “focusing”, i.e., that every state transformation should be the 
result of attempting to eradicate some specific flaw fi without interfering with variables not in vbl(/j). 

1.2 Beyond Measure Regeneration 

Designing {D, p) to regenerate p at every flaw can be highly restrictive. This is commonly demonstrated by 
EEE’s inability to establish that a graph with maximum degree A can be colored with g = A + 1 colors, one 
of the oldest and most vexing concerns about the EEE (see the survey of Szegedy [22]). This is because the 
regeneration of the uniform measure implies that to recolor a vertex v we must select uniformly among all 
colors, rather than colors not appearing in v’s neighborhood, inducing a requirement of (7 > eA colors. 

In [1] we introduced the flaws/actions framework to initiate the study of local search algorithms whose 
actions can depend arbitrarily on the state. In the aforementioned example this could mean choosing only 
among colors not appearing in v’s neighborhood, so that as soon as q > A + 1, the causality digraph 
becomes empty and rapid termination follows trivially. In [1], we required D to be atomic and actions to be 
chosen uniformly, a setting that in our current framework can be seen as the special case where p is uniform, 
D is atomic, and {D,p,p) are harmonic. Here we consider general harmonic {D,p,p), i.e., D need not 
be atomic and p can be arbitrary, while {D, p) need not regenerate p. Rather, p is only used as a gauge of 
progress and deviation from regeneration is traded-off against sparsity of the causality graph. 

The reason we focus on harmonic {D, p, p) is that, as we will see, (i) they are optimal with respect to 
the aforementioned trade-off, and (ii) for every D and p there exists p such that {D, p, p) are harmonic. 

2 Statement of Results 

Definition 1. For i G [m] and r G D, let hi = |{(T £ fi : t £ A{i, (t)}|. For i £ [m\, let hi = maXi-gQ ftj". 
Ifbi = I for all i £ \m], then we say that D is atomic. (Note that hi > 0 since A(i, a) / 0/or a £ fi.) 

2.1 Setup 

We establish general conditions under which focused stochastic local search algorithms find flawless objects 
quickly. Recall that any such algorithm performs a random walk on a multi-digraph D which (i) starts at a 
state fji G D selected according to a distribution 9, and which (ii) at each flawed state a first selects some 
fi 3 a to address and then selects r G A(i, a) as the next state, where each r G A(i, a) is selected with 
probability pi{(7,T). As one may expect the flaw-choice mechanism does have a bearing on the running 
time of such algorithms and we discuss this point in Section 2.6. Our results focus on conditions for rapid 
termination that do not require sophisticated flaw choice (but can be used in conjunction which such choice). 

To establish a walk’s capacity to rid the state of flaws we introduce a measure p on Q. Without loss 
of generality, and to avoid certain trivialities, we assume that p(a) > 0 for all a G O. The choice of p is 
entirely ours and can be trivial, i.e., p{-) = |D|“^. Typically, p assigns only exponentially small probability 
to flawless objects, yet allows us to prove that the walk reaches such an object in polynomial expected time. 
Its role is to define a “charge” 7 * = yi{D, 6, p, p) for each flaw / G F, ideally as small as possible. 
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2.2 Definition of Flaw Charges 

Regenerative case. If {D,p) regenerate p at every fi £ F and either 6 = por D is atomie, then 


General case. Otherwise, 7 * 


7* = 


bi max^g/, Af, where 


Af = max <'/Ji(cj,r) 

tEA(2,(j) 


J 


( 5 ) 


We will diseuss several aspeets of the definition of 7 * in Seetion 3. The main point is that the notion 
of eharge allows us to state our results without having to distinguish between the regenerative and the 
general ease, by simply substituting the appropriate eharge. This also serves to highlight that the standard 
(Probabilistie Method) formulation of the LLL (and its algorithmizations) is, in faet, only a faeet of a far 
more general pieture, for whieh our results provide the first analytieal tools. In the regenerative ease, sinee 
li = hifi)’ our eonditions will parallel those of the LLL (and its algorithmizations). In the general (non- 
regenerative) ease, we will have 7 * > /u(/j) always, but a potentially far sparser eausality graph. To gain 
some first intuition for 7 * as a notion of eongestion in the general ease observe that if p is uniform and D is 
atomie, then 7 * is simply the greatest transition probability pi{a, r) on any are originating in /*. In general, 
it is the ergodie flow from a fo r divided by fhe eapaeify, p{t), of r (and sealed by hi). 

To slate our resulls we need a Iasi definilion regarding fhe dislribulion 9 of fhe initial slate. 

Definition 2, The span of a probability distribution 0 : 17 —)• [0,1], denoted by S{6), is the set of flaws that 
may be present in a state selected according to 6, i.e., S{9) = Uo-Gr 2 e(o -)>0 ^(^)- 


2.3 A Simple Markov Chain 

Our firsl resull eoneerns fhe simplesl ease where, afler ehoosing an arbilrary permufalion vr of fhe flaws, fhe 
algorilhm in eaeh flawed slate a simply addresses fhe grealesl flaw presenl in a aeeording lo vr. Observe 
lhal subsliluling 7 * = p{fi) for fhe regeneralive ease lo Theorem 2 reeovers fhe eondilion of Theorem 1. 

Theorem 2. If there exist positive real numbers {fi} such that for every i £ [m], 

X E n "^7 < 1 

scrii)j&s 

then for every vr the walk reaches a sink within (Tq + s)/(5 steps with probability at least 1 — 2“^, where 
(5 = 1 — maxjgj^] C,i > 0, and 


rr. . ( K<y) 

Tq = log2 max —— 
\a€Q, p[aj 


+ log2 


scsie)jes ) 


Theorem 2 has Iwo fealures worlh diseussing, bolh direels eonsequenees of fhe generality of our frame¬ 
work, i.e., of abandoning fhe Probabilistie Melhod viewpoinl and measure regeneration. 

Arbitrary initial state. Sinee 9 ean be arbitrary in the general ease, any foothold on O suffiees to apply the 
theorem, without needing to sample from O aeeording to some measure. This ean also be interesting when 
we ean not sample from p, but ean regenerate p at every flaw on an atomie D, i.e., the seeond ease of the 
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regenerative setting. Note also that Tq eaptures the trade-off between the faet that when 0 = fx the first term 
in To vanishes, but the seeond term grows to refleet the uneertainty of the set of flaws present in a\. 
Arbitrary number of flaws. The running time depends only on the span |5(0)|, not the total number of 
flaws |F|. This has an implieation analogous to the result of Hauepler, Saha, and Srinivasan [10] on eore 
events: even when |F| is very large, e.g., super-polynomial in the problem’s eneoding length, we ean still 
get an effieient algorithm if, for example, we ean find a state cji sueh that |f7(cJi)| is small, e.g., by proving 
that in every state only polynomially many flaws may be present, or 0 sueh that |5(0)| is small. 

2.4 A Non-Markovian Algorithm 

Our next results eoneerns the eommon setting where the subgraph indueed by the neighborhood of eaeh 
flaw in the eausality graph eontains several ares. We improve Theorem 2 in sueh settings by employing a 
recursive algorithm. The flaw addressed in eaeh step thus depends on the entire trajeetory up that point not 
just the eurrent state, i.e., the walk is non-Markovian. It is for this reason that we required a non-empty set 
of aetions for every flaw present in a state, and why the definition of the eausality digraph does not involve 
flaw ehoiee. The improvement is that rather than summing over all subsets of r(i) as in (6), we now only 
sum over independent sueh subsets, where /*, fj are dependent if fi —)■ fj and fj —)■ /j. This improvement 
is similar to the eluster expansion improvement of Bissaeot et al. [3] of the general LLL. As a matter of faet. 
Theorem 3 implies the algorithmie aspeets of [3] (see [20] and [12]). 

The use of a reeursive algorithm affords an additional advantage, as it enables “responsibility shifting” 
between flaws. Speeifieally, for a fixed aetion digraph D with eausality digraph C, the reeursive algorithm 
(and Theorem 3), take as input any digraph C, i.e., allow for ares to be added to the eausality digraph. 
The reason for this as follows. While adding, say, ares fi —)■ fj and fj —)■ /* may make the sums eorre- 
sponding to fi and fj greater, if is sueh that {fi, fj} C r(/c), then its sum may beeome smaller, as fi, fj 
are now dependent. As a result, sueh are addition may enable a suffieient eondition for rapid eonvergenee 
to a perfeet objeet, e.g., in our applieation on Aeyelie Edge Coloring in Seetion 6. An analogous eounter- 
intuitive phenomenon is also true in the improvement of Bissaeot et al. [3] where denser dependeney graphs 
may result to a better analysis. 

Below, for 5 C F, we let R{S) denote the greatest element of S aeeording to tt. For any fixed ordering 
TT of F the reeursive walk is the non-Markovian random walk on Q that oeeurs by invoking proeedure 
Eliminate. Observe that if in line 8 we do not interseet U(a) with r^(/j) the reeursion is trivialized and 
we reeover the simple walk of Theorem 2. 


Recursive Walk 

I 

procedure Eliminate 


2 

a ^ e{-) 

t> Sample a from 6 

3 

while U{a) $ do 


4 

Address {RiU{a)),a) 


5 

return a 


6 

procedure Address(z, a) 


7 

a T ^ A{i, a) with probability pi{a, r) 


8 

while B = U{a) n Tnifi) / 0 do 

oNote nrR{fi) 

9 

ADDRESS(/,r(-B),0-) 
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Definition 3. Given a digraph Ron F let G = G{R) = (F, E) be the undirected graph where {/, g} £ E 
iff both f ^ g and g ^ f exist in R. For S C F, let Ind(S') = {5' C S : S' is an independent set in G}. 

Theorem 3. Let RD G be arbitrary. If there exist positive real numbers {V’i} such that for every i E [m], 

seind(rfl(i))iGS 

then for every tt the recursive walk reaches a sink within (Tq + s)/5 steps with probability at least 1 — 2~^, 
where 5 = 1 — maxjg[^] Qi > 0, and 


Tq = log. 


( 


e{a) 

max —^ 


+ I 

. SCInd(5(6»)) ies 


Remark 2. Theorem 3 strictly improves Theorem 2 since by taking R = G (i) the summation in (7) is 
only over the subsets ofTji{f) that are independent in G, instead of all subsets ofTji{f) as in (6), and (ii) 
similarly for Tq, the summation is only over the independent subsets ofS{9), rather than all subsets ofS{9). 


Remark 3. Theorem 3 can be strengthened by introducing for each flaw f £ F a permutation ttj 
and replacing n with ttj in line 9 the of Recursive Walk. With this change in (7) it suffices to sum only 
over S C Tfi{f) satisfying the following: if the subgraph of R induced by S contains an arc g ^ h, then 
t^fig) > As such a subgraph can not contain both g ^ h and h ^ gwe see that S £ lnd(TR{f)). 


2.5 A General Theorem 

Theorems 2 and 3 are instantiations of a general theorem we develop for establishing the success of focused 
local search algorithms by local considerations. Before presenting the theorem itself, we first briefly discuss 
its derivation, as that helps motivate and digest the theorem’s form. 

To bound the probability of not reaching a sink within t steps we partition the set of all f-trajectories 
into equivalence classes, bound the total probability of each class, and sum the bounds for the different 
classes. The partition is according to the t-sequence of flaws addressed, which acts as a statistic of the state 
distribution. Formally, for a trajectory T, = ai • • • we let PF(S) = w\,W 2 - ■ ■ denote its 

witness sequence, i.e., the sequence of flaws addressed along S. We let TFt(S) =_L if S has fewer than 
t steps, otherwise we let ITt(S) be the f-prefix of PF(S). Slightly abusing notation we let Wt = fFr(S) 
be the random variable when S is the trajectory of the walk, i.e., selected according to (77, p, 9) and the 
flaw-choice mechanism. If Wt = Wt (A) denotes the range of Wt for an algorithm A, then the probability 
that A takes t or more steps, trivially, is YlweWt Tv\Wt = W]. 

Key to our analysis is deriving upper bounds for Pr[lFt = W] that factorize over the elements of W. 
Specifically, for an arbitrary sequence of flaws A = oi,..., at, let us denote by [t] the index j £ [m] such 
that at = fj. Lemma 1 holds for both the regenerative and the general case, with the corresponding yt. 
Moreover, we will see that it can be tight, up to the prefactor 

Lemma 1. Let ^ = C{9,p) = maXo-gn{0((T)/^(cr)}. For every sequence of flaws A = ai,... ,at, 

t 

Pr[lLt = A] <Cn7[q ■ 

i=l 
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The product form of the bound in Lemma 1 allows us to combine it with different collections of forests, 
each collection expressing an upper bound for (superset of) the set of all possible witness sequences Wt- 
The formulation of the supersets as forests enables the combinatorial enumeration of their elements (flaw 
sequences) which, combined with Lemma 1, yields Theorem 4 below. While Wt depends on flaw-choice, the 
main and common feature of all bounds (forests) is the enforcement of the following idea: while the very first 
occurrence of a flaw fj in a witness sequence W may be attributed to fj 3 ai, every subsequent occurrence 
of fj must be preceded by a distinct earlier occurrence of a flaw fi that can “assume responsibility” for 
fj, e.g., a flaw fi that potentially causes fj. In this way, the set Wt is bounded syntactically by differently 
sophisticated considerations of flaw-choice and responsibility. Specifically, Definition 4 below (i) imposes 
a modicum of control over flaw-choice, while (ii) generalizing the subsets of flaws for which a flaw / may 
be responsible from subsets of r(/) to arbitrary subsets of flaws, thus enabling responsibility shifting. 

Definition 4. Given [D, p, 0), a flaw-choice mechanism is traceable if there exist sets Roots(0) C 2^ and 
List(/i) C 2^, ..., List(/m) C 2^ such that for every t > 1, the set of all possible witness sequences Wt 
can be injected into unordered rooted forests with t vertices that have the following properties: 

1. Each vertex of the forest is labeled by a flaw fi G F. 

2. The flaws labeling the roots of the forest are distinct and form an element o/Roots(0). 

3. The flaws labeling the children of each vertex are distinct. 

4. If a vertex is labelled by flaw fi, then the labels of its children form an element o/List(/j). 

To recover the witness sequence from a forest, thus demonstrating the injection of Wt, we make use of 
the specificity of the mechanism for selecting which flaw to address at each step. For example, the forests 
that correspond to the algorithm of Theorem 3 are “recursion forests”, having one node for each recursive 
call of Address, labelled by the flaw that is the call’s argument. To recover the sequence of addressed 
flaws, we order the trees in the forest and the progeny of each vertex using knowledge of tt and then traverse 
each tree in the recursive forest in postorder. We explain why the algorithms of Theorems 2 and 3 are 
traceable in Appendix A, where we describe the set of witness forests that correspond to each theorem. 

Theorem 4 below implies both Theorem 2 and Theorem 3. While those two theorems do not care about 
the flaw ordering tt. Theorem 4 also captures the “LeftHanded Random Walk” result of [1] (motivated by 
the LeftHanded version of the LLL introduced by Pedgen [19]), under which the flaw order tt can be chosen 
in a provably beneficial way, yielding a “responsibility” digraph. That is, both in the regenerative case and 
in the general case, one can use our 7 * as the charge for each event in the responsibility digraph of [ 1 ]. 

Theorem 4 (Main result). If A results by applying a traceable flaw-choice mechanism on {D,p,6) and 
there exist positive real numbers {iji} such that for every flaw fi G F, 

SGList(/ilies' 

then a sink is reached within (Tq -|- s)/(5 steps with probability at least 1 — 2~^, where (5 = 1 — max^j and 

iG[m] 


^ . ( G{a) 

To = log2 max 

\(7GD 


+ log2 


E ) 

5'€Roots(0) jES" J 
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2.5,1 Proofs of Theorems 2 and 3 from Theorem 4 

In Appendix A, we deseribe the Break Forests and Recursive Forests, into whieh we injeet the witness 
sequenees of the algorithms of Theorems 2 and 3, respeetively. Theorem 2 follows from Theorem 4 as 
Break Forests satisfy the eonditions of Theorem 4 with Roots(0) = and List(/) = Theorem 3 

follows as Reeursive Forests satisfy the eonditions with Roots(0) = Ind(5(0)) and List(/) = Ind(r/j(/)). 


2.6 A Sharp Analysis and the Role of Flaw Choice 

In Seetion 4.3 we prove that Lemma 1 is tight for a rather large elass of algorithms, ineluding the algorithm 
of Moser-Tardos [17] when eaeh flaw fixes the values of its variables, as in SAT, and the algorithm of Harris 
and Srinivasan for permutations [11]. 


Theorem 5. Let /3 = mino-gn f('^) > 0- ^ (D, p) regenerate p at every flaw and D is atomic, then for 
every W = wi,W 2 , ■ ■ ■ ,wt G Wt, 


Fr[Wt = IR] 
ULi dim) 


</ 3 -' ■ 


(9) 


Equation (9) tell us that an algorithm will eonverge to a perfeet objeet in polynomial time if and only if 
the sum Ylw&Mt 0^=1 converges to a number less than 1 as f grows. In that sense, the quality of the 
algorithm’s analysis depends solely on how well we approximate the set of possible witness sequenees Wt- 
The set Wt is elearly a funetion of how we ehoose whieh flaw to address in eaeh step and therefore 
algorithmie performanee elearly depends on the flaw ehoiee meehanism (even more so, in this “tight” ease). 
However, in the Moser-Tardos analysis, as well as in the work of Harris and Srinivasan on permutations [11], 
the flaw ehoiee meehanism “is swept under the rug” [22] and is allowed to be arbitrary. This ean be explained 
as follows. In those two settings, due to the symmetry of Ll, we ean afford to approximate Wt in a way that 
eompletely ignores flaw ehoiee, i.e., eonsidering it adversarial, and still reeover the ELL eondition. In 
a very reeent paper [14], Kolmogorov gives a more general symmetry eondition under whieh the results 
of [1] for the flaws/aetions framework hold with arbitrary flaw ehoiee. However, sueh symmetries ean not 
be expeeted to hold in general settings, something refleeted in Theorems 2 and 3 in the speeifieity of the 
flaw-ehoiee meehanism, while in Theorem 4 it is refleeted in the requirement of traeeability. 


2.7 Applications: Incorporating Global Conditions 

To demonstrate the power of our framework we derive a novel bound for aeyelie edge eolorings, aimed at 
graphs of bounded degeneraey, a elass ineluding graphs of bounded treewidth. To get the result we heavily 
use the faet that we do not have to regenerate a measure (and so the result eannot be eaptured by the EEE). 
Unlike reeent work on the problem [7, 9] that also goes beyond the EEE, our result is established without 
any problem speeifie elements, but rather as a direet applieation of Theorem 3. 


3 Charging Flaws 

In Seetion 2.1 we defined how to assign to eaeh flaw a eharge yi, depending on whether (79, p) regenerate 
p or not. We also stated that in the non-regenerative ease yt > p{fi) always. Thus, ideally, we would 
like to use a sophistieated measure p that assigns minimal probability mass to the flawed states and, at the 
same time, have (79, p, p) that regenerate p at every flaw. In reality, the more sophistieated p is the harder 
regeneration beeomes. Therefore, realistieally, we ean either employ (79, p, p) that regenerate p at every 
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flaw and get charges as small as possible, but for unsophisticated measures (like product measures), or we 
can forgo regeneration, pay the price 7 * > /i(/i) to reflect the distortion of by {D, p), and use more 
sophisticated measures. Crucially, making the latter choice typically also means that we can get a sparser 
causality graph by exploiting the flexibility afforded by not having to design D so as to regenerate p, as in 
the non-regenerative case D can be arbitrary. 

Lemma 2. 7 ^ 


Proof By the definition of 7 *, in the regenerative case we have equality, while in the non-regenerative case. 


li 


< 


E 

o-G/i 




E E E 

cG/i TGA(j,(7) * TeA(i,lj) 


( 10 ) 


where for the last inequality we used that X^cre/i a) enumerates every r G at most hi times. □ 

Observe that for any pair {D,p) taking p so that {D,p,p) are harmonic, i.e., taking pi{a,T) oc /i(r), 
minimizes for all a £ fi simultaneously. This optimality is the main reason that motivates harmonic 
algorithms. The other reason is the realization that designing {D, p) that regenerate p at every flaw is often 
achieved by {D, p, p) being harmonic. As matter of fact, as we state in Theorem 6 below, if D is atomic 
then (D, p, p) being harmonic is necessary for regeneration, a fact that also yields a characterization of the 
local structure of atomic digraphs regenerating a measure. 

As a final remark, we note that all results that correspond to “algorithmizations of the LLL” correspond 
to the (very) special case where {D, p) regenerate p at every /* G F. 


3.1 The atomic case 


Atomic digraphs capture algorithms that appear in several settings, e.g., the Moser-Tardos algorithm [17] 
when constraints are in CNF form, the algorithm of Harris and Srinivasan for permutations [11], and others 
(see [1]). Theorem 6 below asserts that (77, p, p) being harmonic is a necessary condition for regeneration 
when D is atomic. Observe that (77, p, p) being harmonic means that we need not be concerned with the 
design of p as it is implied by (77, p). As for 77 itself, the theorem implies that to build A(i, a) we must 
“collect” arcs that satisfy (11) (while, presumably, keeping the causality graph as sparse as possible). These 
two facts offer guidance in designing 77 so that (77, p) regenerate p at every flaw in atomic digraphs. 


Theorem 6. If D is atomic and (77, p) regenerate p at every flaw, then (77, p, p) are harmonic. Moreover, 
for every i G [m] and every a G fi. 


T€A(i,(T) 


p{(^) 

p{fi) 


( 11 ) 


Proof. If 77 is atomic, p > 0, and (77, p) regenerate p at every flaw f, it follows that for every r G O there 
is exactly one a £ fi such that pj(cT, r) > 0. (And also that IJo-ep ~ ^)- Therefore, regeneration at 

fi in this setting is equivalent to 


for every a £ fi and the unique r G A{i, a): 


Pi{a,T) = p{t) 


P{fi) 

p{a) 


( 12 ) 


(Note that for given 77, p there may be no p satisfying (12), as we also need that YItgAP a) '^) = 1-) 
Since Pi{a, r) oc p(r) in (12) we see that p is harmonic. Summing (12) over r G A{i, a) yields (11). □ 


11 



3.2 Improved charges for the uniform measure case 

As mentioned, the framework of [1] amounts to the ease where D is atomie, ^ is uniform, and {D, p, p) is 
harmonie, so that in every step a uniformly random element of A{i, a) is seleeted. When p is uniform, we 
prove in Seetion 4.2 that 7 * is the inverse of min^gy. of, where af = \A{i, cj)| and, thus, our Theorems 2 
and 3 reeover perfeetly the results of [1]. 

To deal with the ease where D is not naturally atomie, e.g., when a flaw oeeurs under more than one 
value assignments to its variables, one ean proeeed to refine eaeh fi £ F into bi = max^gj. flaws so that 
D becomes atomic. In subsection 4.4, using the machinery developed for the general case in Section 4.2, 
we remove the need to “atomize” D and derive bounds dominating those that come from atomization. To 
achieve this we modify the walk so that each element r G A{i, a) is selected with probability proportional 
to the inverse of its in-degree, i.e., pi{a, r) = ( 6 [ o-) 1/^f Doing this, we get that the charge 

we should assign to each flaw /* (assuming we are in the general case with uniform measure) is: 


4*i 


max — < — , 

{a,T)eDi a£ Qi 


where the right hand side above is the charge on flaw /j that would be assigned by atomization, potentially 
much greater than our bound max(o- .^)g£). bj/af. 


4 Proof of Lemma 1 

In Section 4.1 we give the proof for the regenerative case when the measure p is sampleable (while {D, p, p) 
are not necessarily harmonic). The proof for that case mimics that of [9] and [12]. In Section 4.2 we show the 
proof for the general (non-regenerative) case. The proof for the regenerative case when p is not sampleable 
but D is atomic, is given as a special case of that proof in Section 4.3. Finally, in Section 4.4 we show how 
to get the improved bounds for the general case with uniform measure, described in Section 3.2. 

4.1 The Regenerative Case with Sampleable p 

We start by proving Lemma 1 for the case where 9 = p and {D, p) regenerate p at every flaw. 

Lemma 3. If 9 = p and {D, p) regenerate p at every fi G F, then for every sequence W = wi,... ,wt, 

t 

PT[Wt = W]<llp{wi) . 

i=l 

Proof. To bound Pr[lTt = IL] we will drop the requirement that flaw Wi is selected by the flaw-choice 
mechanism at ai and only require that at £ Wi. To bound this latter probability we introduce a randomized 
process C which given as input an arbitrary sequence of flaws ai, 02 ,..., at proceeds as follows: 

• Select a state ai according to 9 and set fai 1(0) = 0 

• For i from 1 to f do: 

- If ai ^ ai, then set fail(i) = 1 else set fail(z) = fail(i — 1) 

- Address a* at ai according to {D, p), i.e., set cTj+i = r G A{i, a) with probability pi{ai,T) 
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Observe that the sequence fail(i) is non-decreasing. By coupling C with the algorithm we readily get 


t 


Pr \Wt = wi,... ,Wt] < Pr 

c 


Pi cJi G rui 


For t > 0, let P{t) be the proposition: for every A = ai,..., at and every r G O, on input A 


(13) 


Pr [fjt+i = r|fail(t) = 0] = ^(r) . (14) 

c 

We will prove that P{t) holds for all f > 0 by induction on t. Along (13) this readily implies the lemma. 

P(0) follows from 9 = ^. Assume that P{t) holds for all t < s and consider any sequence oi,..., a^. 
If fail(s) = 0 then fail(s — 1) = 0 as well which, by P{s — 1), implies that cr^ is distributed according to fi. 
Moreover, ag G and, therefore, ag+i is selected by addressing at ag- Therefore for every state r G 0, 

Pr [o-^+i = r|fail(s) = 0] = ^ (a, r) = ^(r) , 

where the second equality holds because {D, p) regenerate p every flaw in F. □ 


4.2 The General Case 


Since the flaw addressed in each step depends only on the trajectory up to that point and not on any future 
randomness, the probability of any specific f-trajectory S = ci a 2 • • • (Tt fJt+i is 

t 

P[q ? (1^) 

i=l 


where recall that [i] = j such that wt = fj. To bound Pr[lPt = W] we will sum (15) over all f-trajectories 
with witness sequence W. 

Recall that bj = |{it £ ft : t £ A{i,(7)}\. For each pair {W,at+i), where at+i £ we construct 
an edge-weighted tree as follows. The root of the tree is at+i- Let Pt = {a : at+i £ A([f], cr)}, i.e., Pt 
contains the states in D with an arc to at+i labelled f^t] which, thus, are the possible states immediately 
prior to at+i in any trajectory with witness sequence W. The progeny of the root consists of a child for 
each at £ Pt, each parent-child edge weighted by P[t](o't, o't+i)- Each child vertex acquires progeny in the 
same manner, i.e., it has one child per possible immediately prior state, the corresponding edge annotated 
by /9[i_i](cJt_i, (Jt). And so on, until W is exhausted. The constructed tree has the following properties: 

• Its root-to-leaf paths are in 1-1 correspondence with the trajectories compatible with (VF, at+i). 

• The product of the numbers along each root-to-leaf path (trajectory) equals nl =1 P[i]{cri: o-j-tl)' 

Thus, to compute the probability of all trajectories with witness sequence W and final state at+i it suffices 
to sum, over all root-to-leaf paths, the product of the probability of each path’s leaf vertex under 9 with the 
product of the weights along the path’s edges. 

To bound this sum for a measure p we define for every i £ [m]. 


(f) = max pi{a, r)-^ 

TeA(t,(7) p[T) 


(16) 


13 



Observe that, by definition, for every (a, r) such that r G A{i, a) we have bj > 1 and, therefore. 




g Kr) 

bl n{a) 


(17) 


Therefore, we can bound the probability of any f-trajectory (root-to-leaf path) S = cji,..., cJt+i by 




i=l 


i=l 






t 

K(^t+i)Yl 

i=l 



(18) 


If in the tree we now replace the weight pjj] (cjj, cjj+i) of every edge by the weight of every 

leaf by where ^ = maX(jgQ{0(cj)//i(cj)}, we see by (18) that the aforementioned sum over all 

root-to-leaf paths will give an upper bound on the total probability of trajectories with witness sequence W 
and final sfafe at+i- A key fhing fo observe is fhaf after fhis edge-weighf replacemenf, any vertex of fhe free, 
say one corresponding fo a sfafe r, will have some number bj^ children, while each of ifs parenf-child edges 
will have weighl for some sfafe a. This facf pufs as in a posifion fo perform fhe summation. 

For each i G [m], let (/>* = max^gj. . Consider any vertex v of the tree whose children are leaves and 
let r be n’s state. Replacing the children of v with a single leaf child, connected to v by an edge of weight 
(/>[!], can only increase the contribution to the sum of the subtree rooted at v since ''[.1 X = fl S 

Proceeding to collapse the progeny of all other vertices in the same level of the tree as v and then moving 
on to the next level, etc. collapses the entire tree to a single path whose product of edge-weights is 0^=1 ^[i] 
and whose leaf was weight implying 


t t 

Pr[lCt = 1C] < ^ CF(^t+i) n ^ n <^[*1 • 
+ 2 = 1 2=1 


To conclude the proof for the general case observe that for every i G [m] 


(pi = max^f = max< max pi{a,T)—p^ 

crefi o-G/i ( T£A(i,a) P\j) 


^ hr i \ Pi(<7,r) 

< maxo,- max < n(cj) max --- 

tGH o-G/i ( T&A{i,u) p[T) 


= li 


(19) 


4.3 The Atomic Case and Proof of Theorem 5 


If D is atomic and (79, p) regenerate p at every /j. Theorem 6 implies that (79, p, p) are harmonic and thus 


= Af = max 

T£A{i,a) 


Pi{cr, t] 


f(o') 

Pi'r) 


= max 


p{a) 




(j'G A( 2 ,(j) 




= Pifi) 


( 20 ) 


where the last equality follows from (11). This establishes the regenerative case of Lemma 1 for atomic 79. 

To prove Theorem 5 we note that Lemma 1, valid for any (79, p, p, 6 ), readily yields the upper bound. 
For the lower bound we observe that in order for W G Wt there must exist at least one trajectory S* such 
that lLt(S*) = W. Since, by (20), we have = pi{a, T)p{a)/ p{t) we can conclude that 


Pr[FFi = 1C] > Pr [S = S*] = 9{al)\{pyi^{c 


>i+i) 


n 


p{^i 




2=1 


2=1 


P{^t 


t 

P{(^t+i) 

2 = 1 
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4.4 The Special Case of the Uniform Measure 

To get the improved bounds for the uniform measure reeall that a, = ming-g/. af = \A{i, cj)|. If 

Di is the subgraph of D eomprising the ares labeled by fi, then (19) yields 


(/)j = max= max max pi{a,T)b'l 

u&fi aefi T&A{i,a-) 


bj ^ h 

max — < — . 

{cr,T)eDi Oj 


5 Proof of Theorem 4 

Per the hypothesis of Theorem 4, eaeh bad f-trajeetory on D is assoeiated with a rooted labeled witness 
forest with t vertiees sueh that given the forest we ean reeonstruet the sequenee of flaws addressed along 
the f-trajeetory. Reeall that neither the trees, nor the nodes inside eaeh tree in the witness forest are ordered. 
To prove Theorem 4 we will give Tq sueh that the probability that a (Tq + s)-trajeetory on D is bad is 
exponentially small in s. Let Wt 5 Wt be the set of witness sequenees of size t that eorrespond to these 
forests (sometimes, when it is elear from the eontext, we abuse the notation and use Wt to denote the set of 
of witness forests themselves). Per our diseussion above (see Lemma 1) to prove the theorem it suffiees to 
prove that max^gn Yl,w€Wt ni=i 7 [*] exponentially small in s for f = Tq + s. 

To faeilitate eounting we fix an arbitrary ordering vr of F and map eaeh witness forest into the unique 
ordered forest that results by ordering the trees in the forest aeeording to the labels of their roots and similarly 
ordering the progeny of eaeh vertex aeeording to vr (reeall that both the flaws labeling the roots and the flaws 
labeling the ehildren of eaeh vertex are distinet). 

Having indueed this ordering for the purpose of eounting, we will eneode eaeh witness forest as a rooted, 
ordered d-ary forest T with exaetly t nodes, where d = maxyg^? |List(/)|. In a rooted, ordered d-ary forest 
both the roots and the at most d ehildren of eaeh vertex are ordered. We think of the root of T as having 
reserved for eaeh flaw / G Roots(d) a slot. If / G Roots(d) is the i-th largest flaw in F aeeording to ip 
then we fill the i-th slot (reeall that the flaws labeling the roots of the witness forest are distinet and that, as 
a set, belong in the set Roots(d)). 

Each node n of T corresponds to a node of the witness forest and therefore to a flaw / that was addressed 
at some point in the f-trajectory of the algorithm. Recall now that each node in the witness forest that is 
labelled by a flaw / has children labelled by distinct flaws in List(/). We thus think of each node v of T 
as having precisely |List(/)| slots reserved for each flaw g G List(/) (and, thus, at most d reserved slots 
in total). For each g G List(/) we fill the slot reserved for g and make it a child of v in T. Thus, from T 
we can reconstruct the sequence of flaws addressed with the algorithm. To proceed, we use ideas from [20]. 
Specifically, we introduce a branching process that produces only ordered d-ary forests that correspond to 
witness forests and bound ^i=i O'!*] analyzing it. 

Given any real numbers 0 < < oo we define Xi = and write Roots(d) = Roots to simplify 

notation. To start the process we produce the roots of the labeled forest by rejection sampling as follows: For 
each flaw g £ F independently, with probability Xg we add a root with label g. If the resulting set of roots 
is in Roots we accept the birth. If not, we delete the roots created and try again. In each subsequent round 
we follow a very similar procedure. Specifically, at each step, each node u with label £ “gives birth”, again, 
by rejection sampling: For each flaw g G List(£) independently, with probability Xg we add a vertex with 
label gr as a child of u. If the resulting set of children of u is in List((') we accept the birth. If not, we delete 
the children created and try again. It is not hard to see that this process creates every possible witness forest 
with positive probability. Specifically, for a vertex labeled by I, every set S 0 List(£) receives probability 
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0, while every set S G List(£) reeeives probability proportional to 

'We{S) = (l-Xh) ■ 

g&S /ie(List (£))\5 

To express the exact probability received by each S G List(£) we define 


QiS) := 


nQG5(i 


g£S 


and let Zi = n/G(List(£)) “ ^/)- claim that W£{S) = Q{S) Z^. To see the claim observe that 

W£{S) _ n/iG(List(£))\s (1 “ _ Wges^g _ 


= Q{S) . 


n/eList(£)(l ®/) nges(l ^g) 

Therefore, each S G List(£) receives probability equal to 

W£{S) _ Q{S)Z£ _ Q(S) 

Z]BeList(£) ZlseListfi) ZlseListfi) Qi^) 

Similarly, each set R G Roots receives probability equal to Q{R) (X^^gRoots Qi^)) 
Lemma 4. The branching process described above produces every tree (j) G Wt with probability 


E n 

SGKootsiGS / v^(j) 


J2s&List{v) Qi^) 


Proof. For each node v of f let N{v) denote the set of labels of its children. By (22), 


Q{R) 


Q{N{v)) 


SsGRoots Q{S) Z]seList(D) Q{R) 
Q{R) Y[v£<f>\R'^'v 

^SeRootsQiR) ni,e?i X]seList(i;) 


E ««) n 

S'ERoots / vGfp 


Z]seList(D) QiR) 


Notice now that 


E = E n 


weWt'‘ 


weWt 


rr 

Z]seList([i]) QiR) 


^ E n 


W&Wt^ 


^S&List{[{\) QiR) 


= (maxCij E E 


W&Wt 


SGRoots 


= max( 
V i£F 


Q E 


5 €Roots 
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Using (23) we see that the binary logarithm of the probability that the walk does not eneounter a flawless 
state within t steps is at most t log 2 (maxigi? Q) + Tq, where 


To 


log2 max 


9{a) 


\creo /r(cj) 


+ log2 



Therefore, if f = (Tq + s)/log 2 (l/maxjgi? Q) < (^o + s)/S, the probability that the random walk on 
D does not reaeh a flawless state within t steps is at most 2“^. 


6 Application to Acyclic Edge Coloring 

6.1 Earlier Works and Statement of Result 

An edge-eoloring of a graph is proper if all edges ineident to each vertex have distinct colors. A proper edge 
coloring is acyclic if it has no bichromatic cycles, i.e., no cycle receives exactly two (alternating) colors. 
Acyclic Edge Coloring (AEC), was originally motivated by the work of Coleman et al. [5, 4] on the efficient 
computation of Hessians. The smallest number of colors, Xa(G*)’ for which a graph G has an acyclic edge¬ 
coloring can also be used to bound other parameters, such as the oriented chromatic number [15] and the star 
chromatic number [8], both of which have many practical applications. The first general linear upper bound 
for x'a was given by Alon et al. [2] who proved Xai^l) < 64A(G), where A(G) denotes the maximum 
degree of G. This bound was improved to 16A by Molloy and Reed [16] and then to 9.62(A — 1) by Ndreca 
et al. [18]. Attention to the problem was recently renewed due to the work of Esperet and Parreau [7] who 
proved Xai^l) < 4(A — 1), via an entropy compression argument, a technique that goes beyond what the 
EEE can give for the problem. Very recently, Giotis et al. improved the result of [7] to 3.74A. 

We give a bound of (2 + o(l)) A for graphs of bounded degeneracy. This not only covers a significant 
class of graphs, but demonstrates that our method can incorporate global graph properties. Recall that a 
graph G is d-degenate if its vertices can be ordered so that every vertex has at most d neighbors greater than 
itself. If Gd denotes the set of all d-degenerate graphs, then all planar graphs are in G 5 , while all graphs with 
treewidth or pathwidth at most d are in Gd- We prove the following. 

Theorem 7. Every d-degenerate graph of maximum degree A has an acyclic edge coloring with [(2 + e) A] 
colors than can be found in polynomial time, where e = 16y^ d/A. 

6.2 Background 

As will become clear shortly, the main difficulty in AEC comes from the short cycles of G, with 4-cycles 
being the toughest. This motivates the following definition. 

Definition 5. Given a graph G = {V,E) and a, perhaps partial, edge-coloring of G, say that color c is 
4-forbidden for e £ E if assigning c to e would result in either a violation of proper-edge-coloration, or in 
a bichromatic 4-cycle containing e. Say that c is 4-available if it is not 4-forbidden. 

Similarly to [7, 9] we will use the following observation that the authors of [7] attribute to Jakub Kozik. 

Lemma 5 ([7]). In any proper edge-coloring of G at most 2(A — 1) colors are 4-forbidden for any e G E. 

Proof The 4-forbidden colors for e = {n, n} can be enumerated as: (i) the colors on edges adjacent to u, 
and (ii) for each edge e„ adjacent to v, either the color of e„ (if no edge with that color is adjacent to u), or 
the color of some edge e' which together with e, and an edge adjacent to u form a cycle of length 4. □ 
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Armed with Lemma 5, the general idea is to use a palette P of size 2{A — 1) + Q colors so that whenever 
we wish to (re)color an edge e there will be at least Q colors 4-available for e (of course, assigning such 
a color to e may cause one or more cycles of length at least 6 to become bichromatic). At a high level, 
similarly to [9], our algorithm will be: 

• Start at a proper edge-coloring with no bichromatic 4-cycles. 

• While bichromatic cycles of length at least 6 exist, recolor the edges of one with 4-available colors. 

Note that to find bichromatic cycles in a properly edge-colored graph we can just consider each of the (1^^) 
pairs of distinct colors from P and seek cycles in the subgraph of the correspondingly colored edges. 


6.3 Applying our Framework 

Given G = {V, E) and a palette P of 2(A — 1) + Q colors, let 17 be the set of all proper edge-colorings of 
G with no monochromatic 4-cycle. Fix an arbitrary ordering -k of E and an arbitrary ordering x of P. For 
every even cycle C of length at least 6 in G fix (arbitrarily) two adjacent edges e^' of C. 

- Our distribution of initial state 6 assigns all its probability mass to the following cji G 17: color the 
edges of E in vr-order, assigning to each edge e £ E the x-greatest 4-available color. 

- For every even cycle G of length at least 6 we define fhe flaw /c = {a G 17 : C is bichromatic}. 
Thus, a flawless a G 17 is an acyclic edge coloring of G. 

- The set of actions for addressing fc in state a, i.e., A{C, a), comprises all r G 17 that may result from 
the following procedure: uncolor all edges of G except for e^; go around C, starting with the uncolored 
edge that is adjacent to e^, etc., assigning to each uncolored edge e £ C one of the 4-available colors for e 
at the time e is considered. Thus, by lemma 5, |A(C, cj)| > gi^i-2. 

Lemma 6. For every flaw fc and state t £ fl, there is at most 1 arc a —> t, i.e., bf, < 1. 

Proof. Given r and C, to recover the previous state a it suffices fo extend the bicoloring in r of ef, to 
the rest of G (since G was bichromatic in a and only edges in C \ {ef, e^} were recolored). □ 

Thus, taking ^ to be uniform and p such that (17, p, p) is harmonic yields yc = 

Let R be the symmetric directed graph with one vertex per flaw where fc fc iff G n G' / 0. Since 
a necessary condition for fc to potentially cause fc is that G H G' / 0, we see that i? is a supergraph of 
the causality digraph. Thus, if we run the RECURSIVE WALK algorithm with input R, to apply Theorem 3 
we need to evaluate for each flaw fc a sum over the subsets of r/j(G) that are independent in R. To carry 
out this enumeration we observe that independence in R implies edge-disjointness which, in turn, implies 
that in each (independent) set of cycles to be enumerated, no edge of G appears in multiple cycles. Thus, to 
perform the enumeration it suffices to enumerate the subsets of edges of G that appear in the cycles and for 
each appearing edge e to enumerate all even cycles of length at least 6 containing e. 

Let g{k) = maxeg^; |{/c-cycles in G that contain e}|. If V’c = 'f’{\C\), then we can bound (3) as 
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1 + IE <7(2j)V’(2i) 
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(24) 
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We will prove the following struetural lemma relating degeneraey to g. 

Lemma 1. If G G Gd has maximum degree A, then g{k) < 2(4(iA)(^ 

We are thus left to ehoose f sueh that for every even ICI > 6, the r.h.s. of (24) is strietly less than 1. 
Taking 'ffk) = (SdA) 2 “ and Q = \ 16\/dA] we see that for all k > 6, 


(8dA)V 

7 ,_\ k-2 





3k I c 


< 1 


Regarding the running time, notiee that 6 > 1 — 2 ^ = 15/16 and that it ean easily be seen that that Tq is a 
polynomial in |£'|, A and the number of eolors used. 

Proof of Lemma 7. Fix any edge e = {u, n} G E. To enumerate the /c-eyeles eontaining e we will partition 
them into equivalenee elasses as follows. First we orient all edges of G arbitrarily to get a digraph D. 
Consider now the two possible traversals of the path C \ {u, n}, i.e., the one starting at u and the one starting 
at V. For eaeh traversal generate a string in {0,1}^“^ whose eharaeters eorrespond to sueeessive vertiees of 
the path, other than the endpoints, and denote whether the eorresponding vertex was entered along an edge 
oriented in agreement (1) or in disagreement (0) with the direetion of travel. Observe that eaeh of the /c — 3 
edges of G that have no vertex from {u, n} will ereate a 1 in one string and a 0 in the other. Therefore, at 
least one of the strings will have at least \{k — 3)/2] = {k — 2)/2 ones. Seleet that string, breaking ties 
in favor of the string eorresponding to starting at u. Finally, prepend a single bit to the string to designate 
whether the winning string eorresponded to u or to v. The string denotes C’s equivalenee elass. 

To enumerate all /c-eyeles eontaining e we ean thus enumerate all binary strings of length k — 1 and use 
eaeh string to seleet the k — 2 other vertiees of the eyele as follows: after reading the first eharaeter to deeide 
whether to start at u or at v, we interpret eaeh sueeessive eharaeter to indieate whether we should ehoose 
among the out-neighbors or the in-neighbors of the eurrent vertex. By the string’s eonstruetion, we will 
ehose among out-neighbors q> (/c — 2)/2 times. If Out and In are upper bounds on the out- and in-degree 
of D, respeetively, then the total number of eyeles per string (elass) is bounded by 

To eonelude the argument we note that sinee G G we ean direet its edges so that every vertex has 
out-degree at most d by repeatedly removing any vertex v of eurrent degree at most d (it always exists) and, 
at the time of removal, orienting its eurrent neighbors away from v. □ 
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A Mapping Bad Trajectories to Forests 


In this section, we show how to represent each sequence of t steps that does not reach a sink as a forest with 
t vertices, where the forests have different characteristics for each of the walks of Theorems 2, 3. 

A.l Forests of the Permutation Walk (Theorem 2) 

For S' C F, we denote by I.n{S) = I{S) the greatest element of S according to tt. We will sometimes write 
I {a) to denote I {U {a)). We first show how to represent the witness sequences of the Permutation Walk as 
sequences of sets. 

Let Bi be the set of flaws “introduced” by the f-th step of the walk, where a flaw fj is said to “introduce 
itself’ if it remains present after an action from A{j, ■) is taken. Formally, 

Definition 6 . Let Bq = U{ai). For 1 < i < t — I, let Bi = U{ai+i) \ {U{ai) \ /(o-j)). 

Let B* C Bi comprise those flaws addressed in the course of the trajectory. Thus, B* = Bi\ {Oi U iVj}, 
where Oi comprises any flaws in Bi that were eradicated “collaterally” by an action taken to address some 
other flaw, and iV* comprises any flaws in Bi that remained present in every subsequent state after their 
introduction without being addressed. Formally, 

Definition 7. The Break Sequence of a t-trajectory is B^,Bl,, B*_^, where for 0 < i < t — 1, 

Oi = {f e Bi\ 3j e [i + l,t] : f [/(fjj+i) A Vf G [f + l,j] : / / wi} 

Ni = {f ^ Bi I Vy G [f + 1, f] : / G U (fij+i) A Vf G [f + 1, f] : / 7 ^ w{\ 

B* = B, \ {Oi U Ni] . 

Given Bq, B{,..., B*_^ we can determine the sequence of flaws addressed wi,W 2 ,... ,Wi inductively, 
as follows. Define Ei = Bq, while for i > 1, 

Ei+i = {Ei - Wi) U B* . (25) 

By construction, the set Ei C U{ai) is guaranteed to contain Wi = I{a] = I{U{ai)). Since / = Ijr returns 

the greatest flaw in its input according to tt , it must be that iTriEi) = Wi. We note that this is the only place 

we ever make use of the fact that the function I is derived by an ordering of the flaws, thus guaranteeing that 
for every f G E and S' C F, if / (S) / / then I{S \ f) = I {S). 

We now give a 1-to-l map, from Break Sequences to vertex-labeled unordered rooted forests. Specifi¬ 
cally, the Break Forest of a bad f-trajectory S has | Bq \ trees and t vertices, each vertex labelled by an element 
of 1F(S). To construct it we first lay down |Fq| vertices as roots and then process the sets BfB^,... in 
order, each set becoming the progeny of an already existing vertex (empty sets, thus, giving rise to leaves). 

Break Forest Construction _ 

1 : Lay down |Fq| vertices, each labelled by a different element of Bq, and let V consist of these vertices 
2 : for i = 1 to f — 1 do 

3: Let Vi be the vertex in V) with greatest label according to vr 

4: Add \B*\ children to Vi, each labelled by a different element of B* 

5: Remove Vi from V ; add to V the children of n*. 


Observe that even though neither the trees, nor the nodes inside each tree of the Break Forest are ordered, 
we can still reconstruct W{T] since the set of labels of the vertices in V) equals Fj for all 0 < i < f — 1. 
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A.2 Forests of the Recursive Walk (Theorem 3) 

We will represent eaeh witness sequenee W = W (S) of the Reeursive Walk as a vertex-labeled unordered 
rooted forest, having one tree per invoeation of proeedure ADDRESS by proeedure ELIMINATE. Speeifieally, 
to eonstruet the Recursive Forest (/> = i;A(S) we add a root vertex per invoeation of ADDRESS by ELIMINATE 
and one ehild to every vertex for eaeh (reeursive) invoeation of ADDRESS that it makes. As eaeh vertex 
eorresponds to an invoeation of ADDRESS (step of the walk) it is labeled by the invoeation’s flaw-argument. 
Observe now that (the invoeations of ADDRESS eorresponding to) both the roots of the trees and the ehildren 
of eaeh vertex appear in W in their order aeeording to tt. Thus, given the unordered rooted forest i;A(S) we 
ean order its trees and the progeny of eaeh vertex aeeording to tt and reeover W as the sequenee of vertex 
labels in the preorder traversal of the resulting ordered rooted forest. 

Reeall the definition of graph G on F from Definition 3. We will prove that the flaws labeling the roots 
of a Reeursive Forest are independent in G and that the same is true for the flaws labelling the progeny of 
every vertex of the forest. To do this we first prove the following. 

Proposition 1. /jf ADDRESSfi, cr) returns at state r, then U{t) C U{a) \ (r/j(/j) U {/*}). 

Proof. Let a' be any state subsequent to the ADDRESS(i, cr) invoeation. If any flaw in U{a) n T R{fi) is 
present at a', the “while” eondition in line 8 of the Reeursive Walk prevents ADDRESS(i, a) from returning. 
On the other hand, if fh G PRifi) \ U{a) is present in a', then there must have existed an invoeation 
ADDRESS(y, a”), subsequent to invoeation ADDRESS(i, a), wherein addressing fj eaused fh- Consider the 
last sueh invoeation. If a"' is the state when this invoeation returns, then 0 U{a'"), for otherwise the 
invoeation eould not have returned, and by the ehoiee of invoeation, is not present in any subsequent state 
between a'" and r. □ 

Let ([i], (T*) denote the argument of the i-th invoeation of ADDRESS by ELIMINATE. By Proposition 1, 
{17(iT*)}j>i is a deereasing sequenee of sets. Thus, the elaim regarding the root labels follows trivially: for 
eaeh i > 1, the flaws in PR{fi) U fi are not present in and, therefore, are not present in U{a^), for 
any j > i -|- 1. The proof for the ehildren of eaeh node is essentially identieal. If a node eorresponding 
to an invoeation of ADDRESS has q ehildren, eorresponding to q (reeursive) invoeations with arguments 
{(a*, then the sequenee of sets {U is deereasing. Thus, the flaws in rj:j(ai) U {aj} are not 

present in and, therefore, not present in U{t^), for any j >i + P 
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